Michael R. Crusoe, et al. (2022): Methods Included: Standardizing Computational Reuse and Portability with the Common Workflow Language. Communications of the ACM 65(6)
Rudolf Wittner, et al. (2023): ISO 23494: Biotechnology – Provenance Information Model for Biological Specimen And Data. Lecture Notes in Computer Science 12839
Rafael Ferreira da Silva, et al. (2021): A Community Roadmap for Scientific Workflows Research and Development. 2021 IEEE Workshop on Workflows in Support of Large-Scale Science (WORKS).
Farah Zaib Khan, et al. (2019): Sharing interoperable workflow provenance: A review of best practices and their practical application in CWLProv. GigaScience 8(11)
Eoghan Ó Carragáin, et al. (2019): RO-Crate: A lightweight approach to research object data packaging. RO-15 at Workshop on Research Objects (RO 2019), IEEE eScience 2019, 2019-09-24, San Diego, CA, USA.
Marjan Meurisse, et al. (2023): Federated causal inference based on real-world observational data sources: Application to a SARS-CoV-2 vaccine effectiveness assessment. BMC Medical Research Methodology 23:248
Ignacio Eguinoa, et al. (2023): BioHackEU22 Report: Enhancing Research Data Management in Galaxy and Data Stewardship Wizard by utilising RO-Crates. BioHackrXiv
Sebastian Beier, et al. (2024): BioHackEU23 report: Enabling continuous RDM using Annotated Research Contexts with RO-Crate profiles for ISA. BioHackrXiv
Stian Soiland-Reyes, et al. (2024): Practical webby FDOs with RO-Crate and FAIR Signposting: Experiences and lessons learned. International FAIR Digital Objects Implementation Summit 2024, Berlin, Germany, 2024-03-20/--21. Open Conference Proceedings 4 (submitted)
Oliver Woolland, et al. (2022): Incrementally building FAIR Digital Objects with Specimen Data Refinery workflows. Research Ideas and Outcomes 8:e94349
Can the promising FDO concept be realised using existing Web technology, taking into account the lessons learnt from the early Semantic Web developments and more recent Linked Data practices?
Can a more pragmatic use of Linked Data practices better implement Research Objects for a wider developer audience, by using familiar Web technologies and give lightweight recommendations?
Can a FAIR Digital Object approach for computational workflows unify machine-readable descriptions of Research Software, data and provenance, which can be consistently implemented by developers of different workflow management systems?
Section 5.2: The Specimen Data Refinery: A canonical workflow framework and FAIR Digital Object approach to speeding up digital mobilisation of natural history collections
Section 5.2: The Specimen Data Refinery: A canonical workflow framework and FAIR Digital Object approach to speeding up digital mobilisation of natural history collections
Alex Hardisty, et al. (2022): The Specimen Data Refinery: A canonical workflow framework and FAIR Digital Object approach to speeding up digital mobilisation of natural history collections. Data Intelligence 4(2)
In this chapter, we discuss the related work with respect to FAIR Digital Objects and Linked Data. We do so by looking through the lens of development of these technologies over time, including future directions.
To investigate RQ1 this chapter evaluates both Linked Data and FAIR Digital Object (FDO) as ways to realize the FAIR principles. Section 3.1 compares the two approaches as global distributed object systems, and discusses what lessons can be learnt across the communities, taking into consideration the history covered by section 2. Section 3.2 proposes how the FDO principles can be achieved using Linked Data standards, which is explored further in the following chapters.
This chapter introduces RO-Crate, a pragmatic method of packaging data alongside structured metadata that is inline with the FAIR principles. This has been implemented to investigate RQ2. Section 4.1 describes the RO-Crate purpose, community effort and tooling and demonstrates how RO-Crate has been applied. Section 4.2 shows how RO-Crate can be used to achieve the FDO principles covered in chapter 3. Section 4.3 contributes a formal definition of RO-Crate using first order logic.
In order to investigate RQ3, and considering important parts of the FAIR principles are Reuse and provenance, this chapter examines in closer details how FAIR Digital Objects and RO-Crate can be used with Computational Workflows. Section 5.1 proposes that tools in computational workflows, when wrapped as interoperable building blocks, can be considered as FAIR Digital Objects, with a use case from biomolecular simulation. Sections 5.2 and 5.3 explore how FDOs and Research Objects can be constructed incrementally using computational workflows, with a use case from specimen digitization in natural history collections. Section 5.4 presents a profile of RO-Crate to capture workflow execution provenance, with incremental granularity levels and six workflow engine implementations. Use cases include machine learning-aided tumour detection and compatibility with PROV approaches.